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Abstract. We introduce equivariant tree models in algebraic statistics, which 
unify and generalise existing tree models such as the general Markov model, 
the strand symmetric model, and group-based models such as the Jukes-Cantor 
and Kimura models. We focus on the ideals of such models. We show how the 
ideals for general trees can be determined from the ideals for stars. A corollary 
of theoretical importance is that the ideal for a general tree is generated by 
the ideals of its Battenings at vertices. The main novelty is that our results 
yield generators of the full ideal rather than an ideal which only defines the 
model set-theoretically. 



1. Set-up and theorems 

In phylogenetics, tree models have been introduced to describe the evolution of a 
number of species from a distant common ancestor. Given suitably aligned strings 
of nucleotides of n species alive today, one assumes that the individual positions 
in these strings have evolved independently and according to the same statistical 
process. Counting and averaging thus yields an empirical probability distribution 
on the set {A, C, G, T}". On the other hand, any hypothetical evolutionary tree 
whose leaves correspond to the n species gives rise to a parameteriscd family of 
probability distributions on C, G, T}"; see Section [3] for details. Here the pa- 
rameters consist of an initial distribution and transition matrices along the edges 
of the tree. The challenge is to test whether the tree fits the data, that is, whether 
the empirical distribution lies in the family. One way to do this is the use of phy- 
logenetic identities, equations that vanish identically on the family. This paper 
is concerned with constructing such identities for general trees from identities for 
smaller trees. The set-up below unifies and generalises existing tree models in al- 
gebraic statistics, while allowing for a clean and elegant treatment with methods 
from classical invariant theory. For more information on algebraic statistics and its 
applications see [71 [15] and the many references there. 

Remark 1.1. The term phylogenetic invariants is commonly used for phylogenetic 
identities. To avoid confusion with the invariants in classical invariant theory, we 
do not use this term. 

First, recall that a tree T is a connected, undirected graph without circuits; all 
our trees are finite and have at least two vertices. The valency of a vertex of T is 
the number of edges containing it. A vertex of T is called a leaf if it has valency 
1, and an internal vertex otherwise; if p, q are vertices, we write p ^ q if there is 
an edge connecting them. We write vertex(T), leaf (T), internal(r) for the sets of 
vertices, leaves, and internal vertices of T, respectively. Stars are trees of diameter 
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at most 2, and a centre of a star is a vertex at distance 1 to all other vertices — so 
if the star has more than 2 vertices, then its centre is unique. A subtree of a tree 
is a connected induced subgraph, and a substar is a subtree which itself is a star. 
So the map that sends a vertex p of T to the induced subgraph on p together with 
its neighbours is a bijection between vertex(T) and the set of substars of T, except 
when T consists of a single edge. In all that follows, we work over a ground field 
K that is algebraically closed and of characteristic zero. 

Definition 1.2. A spaced tree T is given by the following data: First, a finite 
undirected tree, also denoted T; second, for every p G vertex(T) a finite-dimensional 
vector space Vp] third, a non-degenerate symmetric bilinear form (. | .)p on each 
Vp] and fourth, for every p € internal(T) a distinguished basis Bp of Vp which is 
orthonormal with respect to (. | .)p. 

The space Vp at a, leaf p may also be given a distinguished basis Bp, orthonormal 
with respect to (. | .)p, in which case p is called a based leaf. An internal vertex of 
T will also be called based. Any subtree of the underlying tree of T is regarded as 
a spaced tree with the data that it inherits from T. 

Note that there is some redundancy in this definition: given the distinguished 
basis Bp at a based vertex p one could define (. | .)p by the requirement that Bp 
be orthonormal. We will leave out the subscript p from the bilinear form when it 
is obvious from the context. In many applications in algebraic statistics, symmetry 
is imposed on the algebraic model. This notion is captured well by the following 
notion of a G-spaced tree. Fix, once and for all, a finite group G. 

Definition 1.3. A G-spaced tree (or G-tree, for short) is a spaced tree T in which 
the space Vp at every vertex p is a G-module, on which (. | .)p is G-invariant, and in 
which Bp is G-stable whenever p is a based vertex. Any subtree of the underlying 
tree of T is regarded as a G-spaced tree with the data that it inherits from T. 

The objects that parameterise probability distributions in the algebraic model 
are what we propose to call representations of spaced trees. 

Definition 1.4. Let T be a spaced tree. A representation of T is a collection of 
tensors {Aqp)q^p £ Vq ® Vp along the edges of T with Apq — Aqp, where •* is the 
natural isomorphism Vp (E) Vq Vq (E)Vp. The space of all representations of T is 
denoted rep(r). A representation of T gives rise to a representation of any subtree 
of T. 

For a G-spaced tree T, a G -representation or equivariant representation of T is a 
representation {Aqp)p^q where each Aqp is a G-invariant element in the G-module 
Vq(EVp. The space of such representations is denoted rep(5{r). A G-representation 
of T gives rise to a G-representation of any subtree of T. 

Using the bilinear form (.|.)p we may identify Vp with its dual V*, and hence 
Vq(EVp with VqC^Vp = IIom(V^, Vq). Thus viewing Aqp as a linear map Vp ^ Vq 
and, similarly, Apq as a linear map Vq Vp, the condition Apq — A*p translates 
into {AqpV I w)q = {v I Apqw)p for all V &Vp and w ^Vq. Put yet differently, if p 
and q are both based, then this says that the matrix of Apq relative to the bases Bp 
and Bq is the transpose of the matrix of Aqp. In the applications to statistics, the 
spaces Hom(V^, Vq) or the space of \Bq\ x |i3p| -matrices are perhaps more natural to 
work with than Vq®Vp, as the elements of a representation correspond to transition 
matrices; see Section [3l However, there are good reasons to work with Vq®Vp; for 
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instance, the correet aetion of GL{Vp) x GL(yq) on the edge parameters turns out 
to be the natural action on Vq(^Vp rather than that on Hom(Vij, Vp); see Lemma ISTTl 
and the proof of Proposition [231 Finally we note that if T is a G-tree, then by the 
invariance of the bilinear form (.|.)p the identifications above still make sense when 
passing to G-invariant elements: {Vq Vp)'~^ = HomG(V^, Vq), etc. 

Notice the slight discrepancy between our notion of representations and the no- 
tion in quiver representation theory, where — apart from the fact that the underlying 
graph is directed — the spaces Vp form part of the data comprising a representation. 

A T-tensor is any element of (S'peicaf(T) which space we will denote by L{T) 
throughout the text. T-tensors correspond to marginal probability distributions in 
statistics; see Section [3l An important operation on spaced trees, representations, 
and T-tensors is *, defined as follows. Given k spaced trees Ti, . . . , whose vertex 
sets share a common based leaf q with common space Vq and common basis Bq but 
which trees are otherwise disjoint, we construct a new spaced tree *iTi obtained 
by gluing the Ti along g, while the space at a vertex p of *iTi coming from Ti is 
just the space attached to it in T^, with the same distinguished bilinear form, and 
the same basis if p is based. Given representations G rep(Ti) for i = 1, . . . , /c, 
we write -^lAi for the representation of *,jT!j built up from the Ai. Now let '^i be a 
Ti-tensor, for all i. Then we obtain a T-tensor by tensoring as follows: 

:== ®«(^ I 

where we abuse the notation (. | .) for the natural contraction 

VqX ^ Vp^ (g) Vp 

p6lcaf(Ti) )9Glcaf(T0\{9} 

determined by the bilinear form (. | .)q. Notice that this * operator is not a binary 
operator extended to several factors; nevertheless, when convenient, we will write 
Ti * • ■ • * Tfc for *iTi and * ■ • • * for 

Now we come to a fundamental procedure that associates to any representation of 
a spaced tree T a T-tensor. Let A e rep(T). We proceed inductively. First, if T has 
a single edge {p, g}, then '^t{A) := Aqp, regarded as an element of L{T) — Vq®Vp. 
If T has more than one edge, then let q be any internal vertex of T. We can then 
write T = *pn^qTp, where Tp is the branch of T around q containing p, constructed 
by taking the connected component of T— g (the graph obtained from T by removing 
q and all edges attached to q) containing p, and reattaching q to p. 

The representation A induces representations Ap of the Tp, and by induction 
"i^TpiAp) has been defined. We now set 

^t{A) := *p^,*T,(Ap). 

A straightforward proof by induction shows that this is independent of the choice 
of q and that this formula is also valid if q is actually a leaf. Now we can define the 
key objects of this paper. 

Definition 1.5. Let T be a spaced tree. The general Markov model associated to 
T is the algebraic variety 

CVgm(T) := {*t(A) I A e rep(T)} C T(T), 

where the closure is taking in the Zariski topology. 
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Similarly, for a G-spaced tree T, the equivariant model associated to T is the 
algebraic variety 

CVem(T) {*t(A) I A e repG(r)} C L{T). 

Notice that a priori both the individual tensors and the varieties CVqm{T), 

CVem(T') depend on the bases Bq at internal vertices q. This is only natural, as in 
applications these bases have an intrinsic meaning; see Section [31 However, more 
can be said about this dependency; see Lemma [01 

To streamline our discussion, we will consider CVgm as the special case of CVem 
where G is trivial. An important goal in algebraic statistics is finding the ideal of 
all polynomials on the space L{T) that vanish on CVem{T). Our first result is a 
procedure for constructing these ideals from the ideals for substars of T. 

Theorem 1.6. For any G-spaced tree T, the ideal /(CVem(T')) can he expressed 
in the ideals /(CVem('S')) where S runs over the G-spaced substars of T with at 
least three leaves. In particular, for any spaced tree, the ideal o/ /(CVgm(T')) can 
be expressed in the ideals /(CVgm('S')) where S runs over the spaced substars of T 
with at least three leaves. 

This theorem is admittedly formulated somewhat vaguely. However, its proof 
in Section [5] gives rise to the explicit, recursive Algorithm 15.61 for determining 
/(CVem(T')) from the ideals /(CVem(5')); this justifies the present formulation. 

We now present a variant of Theorem 11.61 which is perhaps less useful for actual 
computations, but which is of fundamental theoretical interest. This variant uses a 
second important operation on spaced trees and leaf tensors, namely, flattening. Fix 
any vertex g in a spaced tree T, and define an equivalence relation on leaf (T) U {q} 
hy p = r il and only if either p ^ q ~ r oi p,r ^ q \ie in the same connected 
component of T — q. Construct a spaced star b^T as follows: First, the vertex set 
is the set leaf(T) U {q}/ = of equivalence classes, and the class of q is attached to 
all other classes by an edge. To the class C we attach the space Vc ■— 0pgc ^ 
equipped with the bilinear form inherited from the Vp, and if all p G C are based, 
then C is based with the tensor product of the bases Bp. This new spaced tree bqT 
is called the flattening of T at q. Note that we allow g to be a leaf of T, in which 
case \)qT has a single edge. The space L{T) of T-tensors is naturally identified 
with the space L{\)qT) of b^T-tensors, and expanding the definition of \1/t at q one 
readily finds that 

CVem(T) C CVEMlbqT) for all q. 
Our second main result shows that this characterises CVem(T')- 

Theorem 1.7. For any G-spaced tree T we have 

/(CVem(T)) = HCVemM). 

(36vcrtox(T) 

Remark 1.8. (1) If T has more than one edge, then it suffices to let q run 
over internal(T). 

(2) To avoid confusion we stress that bgT is not a substar of T, unless T itself 
is a star with centre q, in which case b^T = T. 

Many special cases of our main results are known in the literature. In particular 
[T] contains set-theoretic versions of our theorems for the general Markov model, and 
poses Theorem II .71 for the general Markov model as Conjecture 5. In [IB] the ideals 
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of equivariant models with G abelian and all Vp equal to the regular representation 
KG are determined, following ideas from [8 . An important observation that makes 
this feasible is that these varieties are toric; see also Section [S] Some more specific 
references to the literature may be found in the Section [Sj which explains the 
relevance of spaced trees and their representations to statistics. After that, in 
Section [4] we prove a key tool on multiplying varieties of matrices, which we then 
use in Section [5] to prove our main results. Finally, Section [6] contains a result on 
toricness of certain abelian equivariant models. 

2. Acknowledgments 

The first author thanks Seth SuUivant for his great EIDMA/DIAMANT course 
on algebraic statistics in Eindhoven. It was Seth who pointed out that a result like 
the one in Section |4] could be used to treat various existing tree models in a unified 
manner. 



3. Relevance to statistics 

In the applications of our results to algebraic statistics, the spaced tree T that 
we start with only has based vertices. Indeed, the bases Bp have some physical 
meaning. In phylogenetics, for instance, they are usually all equal to {A, C, G, T}, 
the building bricks for DNA. Furthermore, an internal vertex r is singled out as root, 
and the base field is K := <C^ M. An element of Vp which on the basis Bp has non- 
negative real coefficients that add up to 1 is regarded as a probability distribution 
on Bp] together they form the probability simplex A(V^) C Vp. A representation 
of T is called stochastic if all maps Aqp : Vp ^ Vq directed away from r satisfy 
Aqp/!\{Vp) C ^{Vq), which amounts to saying that the entries of Aqp, regarded as 
a matrix relative to the bases Bq and Bp, are real and non-negative and that Aqp 
has all column sums equal to 1. A root distribution vr e A(I^) and a stochastic 
representation A of T determine a probability distribution on npevcrtox(T) ^'^'^i 
by taking marginals, a distribution on npeioaf(T) which can be thought of as 
an element <^t{A, tt) of A(L(r)). Write T = Ti * • • ■ * at r and let Ai, . . . , 
be the induced representations on the T^. Then the distribution is 

$t(A, tt) = ^ (H ^){h I *Ti (Ai)) ■ • ■ (& I *T,(Afe)), 

which equals where A' G rep(r) is the (non-stochastic) representation 

obtained from A by composing a single Apr leading away from the root with the 
diagonal linear map Vr Vr determined by 6 i-^ 7r(&)&. We define the set 

CVGM(r,r) {$T(A,7r) | tt G A{Vr) and A e rep(r) stochastic.} 

A natural equivariant analogue of this for a G-tree T is 

CVem(7', ?■) := {$t(^, 71") I TT e A(Vr) G-invariant and A e repc.(T) stochastic}, 

but as the following examples from phylogenetics show it also makes sense to allow 
for arbitrary root distributions rather than G-invariant ones; see below how to 
handle these. 

Example 3.1. In all models below, the Bp are all equal to {A, G, G, T} and are all 
equipped with the same permutation action of some G. Recall that the nucleotides 
fall into two classes of bases, according to their chemical structure: the purines 
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Adenine and Guanine and the pyrimidines Cytosine and Thymine. This explains 
some of the choices in the fohowing models. All of them are equivariant models in 
our sense. The labels of these models are those used in [T51. 

(1) In the Jukes-Cantor model JC69 G = Sym{{A,C,G,T}) (or the alternat- 
ing group, which has exactly the same equivariant maps Vp Vp). One 
assumes a G-invariant root distribution — which in this case means that it 
is uniform. 

(2) In the Kimura model K80 G is the dihedral group generated by {A, C, G, T) 
and {A, G). It is the group of symmetries of the following square. 

A - G 

I I 
T - G 

Again, the root distribution is taken G-invariant, which means uniform. 

(3) In the Kimura model K81 G is the Klein 4-group and the root distribution 
is G-invariant (uniform). 

(4) In the strand- symmetric model CS05 G generated by the transpositions 
(A, G) and {G,T) and the root distribution is G-invariant. 

(5) In the HKY85 model G is as in the strand-symmetric model, but one allows 
for non-G- invariant root distributions. 

(6) In the Felsenstein model FBI G is the full symmetric (or alternating) group, 
and the root distribution arbitrary. 

The ideals of all these models were determined in [IJITB]. Moreover, |3J gives 
local equations at biologically meaningful points. 

Remark 3.2. A similar construction of tree models appears in [2]. There the 
spaces at all vertices are required to be the same space VF, and the tensors at 
the edges are allowed to vary in some fixed subspace W oiW ®W consisting of 
symmetric tensors. Otherwise the construction of the model is the same. We should 
mention that not all models obtained in this manner fit within our framework. For 
instance, the model where W is the entire space of symmetric tensors cannot be 
characterised as the set of G-invariant tensors vaW ®W for some group G acting 
on W . Our present approach does not apply to this setting. 

Similar to the observations in [1], as a consequence of the construction of CVEM(r), 
it is a closed cone (i.e. invariant under scalar multiplication in L(T)) and therefore 
uniquely defines a projective variety in P(L(T)), denoted P(CVEM(r)), and defined 
by the same ideal as CVem(7'). Notice that because the elements of CVem(T, r) 
have coordinate sum equal to 1, CVem(T, r) actually maps injectively into P(L(T)). 
The following proposition justifies our quest for the ideal I{CVem{T)): it con- 
tains all homogeneous polynomials vanishing on the statistically meaningful set 
CVem (T,r). 

Proposition 3.3. Provided that all Vp are non-zero, the image of the set CVem(T, r) 
in ¥{L(T)) is Zariski dense in the variety P(CVem(T)). 

Proof. First, the set of root distributions on Br is clearly Zariski-dense in the set 
of all (complex) tt £ Vr with X^bes ^ ^- Similarly, for a single edge pq 

pointing away from r, the stochastic matrices in HomG(T^,yg) are Zariski dense 
in the complex matrices in HomG(T^, Vq) with column sums 1. This follows from 
an explicit parameterisations of such equivariant stochastic matrices A: for every 
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b G Bp/G the 6-th column of A varies in a certain (scaled probability) simplex of 
dimension \Bq/Gb\ — 1, where Gh is the stabiliser of b in G. This simplex is dense 
in the subset of V^'' where the sum of the coordinates is 1. 

Next we claim that for A in an open dense subset of rep(3(T) we can write 
'^t{A) as s$t(^',7I') for some A' £ repQ(T) having column sums 1 and some 
TT with X^b"""!^) = 1- To see this, first fix a vertex q and take for every vertex 
p ^ q a copy of the torus (K*)^"^^, considered as the diagonal subgroup of GL{Vq) 
acting by multiplication by a scalar on the span of each G-orbit on Bq and hence 
centralising G on Vq. The copy for p acts on Hom(l^, V^) by g{A) := gA and on 
Kom{Vq, Vp) by g{A) := Ag. Note that this latter action is not the natural one on 
HomiVq^Vp), in which g would be replaced by its inverse, but that this action is 
the natural one on Vq (8> Vp] see also Lemma l5. II and the remarks preceding it. In 
particular, this action maps representations to representations. A straightforward 
computation shows that the subtorus 

Hg {{cp,i>)bGBjG,p~g ^ [](A-*)^'/G | J] Cp,, = 1 for all b e 

leaves ^"7- invariant. Now let A be any G-representation of T such that all column 
sums of all Aqp directed away from r are non-zero; this is an open dense condition 
on A. Given any non-root vertex q, by acting with Hg we can achieve that the Aqp 
leading away from r have column sums 1, while the map Aqp, where p is the parent 
of q relative to r, may not. If we do this for all non-root vertices in a bottom-up 
manner, and finally also for H^., then we achieve that all Aqp leading away from r 
have column sums 1, except for a single Apr; note that we have not altered '^t{A) 
in this process. Denote the column sums of Apr by {ab)bGB^- Dividing column b of 
Apr by (Tb gives a representation A' all of whose matrices leading away from r have 
column sums 1. Also, for A in an open dense subset, X^b'^b ~' ^ non-zero, and 
dividing cr by s gives a tt adding up to 1 such that s^{A' , tt) = ^{A). This proves 
the claim, and hence the proposition. □ 

As we saw in the examples above, one may want to allow arbitrary root dis- 
tributions, which are not necessarily G-invariant. More generally, one might want 
to allow the root distribution to vary in a certain self-dual submodule of Vr, and 
this would require only minor changes in the discussion that follows — but here we 
concentrate on the situation where all elements of (the probability simplex in) V- 
are allowed. We define the set 

CVem(T, Vr) {^t{A,tt) \ A £ repG(T) stochastic and tt G A(1^^)}. 

One can elegantly describe CVem(T, K-) as follows. Let T' be the spaced tree 
obtained from T by connecting a new vertex r' to the root r of T and attaching to r' 
the G-niodule Vr' := Vr, endowed with the same bilinear form. Then L{T') = Vr> ^ 
L{T), and since Vr' = V;, we may think of CVem(T') as a subset of Hom(l/,./, L(r)). 
Since it also consists of G- fixed points, and since this identification is G-equivariant, 
it is a subset of HomciVr' , L{T)). 

Proposition 3.4. The image of CYem{T, Vr) in V{L{T)) is a Zariski dense subset 
in the projective variety associated to the cone 



CVem(T')K.', 
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where we regard CVEM(r') as a subset of HouiciVr' , L{T)) . The ideal of this cone 
can be determined from the ideal of CYEmiT'). 

Proof. We have 

where A' is obtained from A by putting the identity ly^ along the edge rr' . This 
shows that ^t{A,tt) is contained in CVEM{T')Vr' . For the converse we reason as 
before: for A' in an open dense subset of repQ(T') we can write '$t'{A') as vjfj^, (A) 
where aU Aqp directed away from r have column sums equal to 1 except possibly 
for Ar'r- We have 

= ^(6|A,v^)(M *Ti(Ai))®...® (M *T„(A™)) 

where s is taken such that tt' := s^^Ar'r"^ has '^i,'n''{b) = 1. 

The proof of the last statement is deferred to the end of Section [5l □ 

4. Multiplying varieties of matrices 

In this section we derive a key tool that will be used in the proofs of our results. 
As before let K be an algebraically closed field of characteristic 0, and let f be a 
natural number. For k, 1 g N' let Mkj denote the space Mk^Ai x • • ■ x Mk^^it, where 
Mk i is the space of p x g-matrices over K. To formulate and prove our results 
in their full strength, it is convenient to use some notions from the language of 
schemes, for which we refer to The main point here is that we do not require 
ideals to be radical. 

Recall that if X is an afhne variety, then a closed subscheme 5* of X is given by 
an ideal / of the ring K[X] of regular functions on X: the underlying closed subset 
of X is the set of zeros of /, and the iiT-algebra associated to S is K[X]/I. We write 
I{S) for the ideal of the subscheme S. If X is a variety on which some group F 
acts, then S is called a F-subscheme if and only if I{S) C K[X] is F-stable. Finally, 
if / : X — > y is a map between varieties, inducing the pull back homomorphism 
/«: K[Y] K[X], and ii S C X is a subscheme, then the image scheme of S is 
defined as the scheme theoretic closure of f{S), i.e. the subscheme of Y defined by 
the ideal {f^)~-^{I{S)). By slight abuse of notation it is usually denoted as f{S). 
It is clear that if / is F-equivariant for some group F acting on X and Y, then 
the image of a F-subscheme is again a F-subscheme. Also notice that if 5 is a 
subvariety, i.e., if I{S) is radical, then so is f{S) — it is precisely the Zariski closure 
of the set-theoretic image of S under /. 

We now specialise to multiplying schemes of matrices. We write /k: Mk,i x 
Ml m — > Mic f„ for the multiplication and /i" for the co- multiplication, /!'(/) (A, B) = 
f{A ■ B). Given two subschemes V C Afk.i and W C Mi^rn, put 

V -W -.^ ^i{V,W). 

If V, W are subvarieties this is just 



{AB \A&V,B & W}, 
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where AB {AiBi, . . . , AtBt). In general, the underlymg topological space is still 
the closure of the set ^j,{V, W), but the ideal is {^J.*)-^{I{V x W)). 

The operation ■ is associative in the sense that, given a third subscheme U C 
Mj^k, one has 

U ■ {V -W) ^ {U -V) -w 

we therefore simply write U ■ V ■ W. 

Let GLi = GLjj x GLi^ x • • • x GL/^ C M\. Frequently the subschemes we are 
interested in will be invariant by left- or right-multiplication by GLi. In this context 
it is worth mentioning that a subscheme V C Afk.i is a GLi-subscheme, i.e., stable 
by right-multiplication, if and only if V ■ M\,\ — V. This follows from the fact that 
for any if-algebra A and any nonconstant / £ K[xi, . . . ,Xm], Spec^[xi, . . . ,Xm]f 
is dense in SpecA[a;i, . . . ,Xm], and thus V ■ GLi is (scheme theoretically) dense in 

V -Mx. 

Theorem 4.1. Let k, 1, m G N* and let V and W he subschemes of M\^,\ and Afi^m; 
respectively. Then we have 

(1) I(V ■ Mi,i • W) = I{V ■ Mi^^) + I{M],j ■ W). 

In the following proof we will use the First Fundamental Theorem in invariant 
theory, which describes the invariant polynomials of GL„ on a direct sum of copies 
of K"^ and the dual space {K")* . This theorem is due to Weyl ^7}. Another 
tool from representation theory is the Reynolds operator: when a reductive group 
acts rationally on a vector space, then the Reynolds operator p is the projection 
onto the invariant vectors with kernel the direct sum of all non-trivial irreducible 
submodulcs. Modern treatments on invariant theory are [51 [TOl [T^ . 

Proof. The inclusion D is obvious. To prove the opposite inclusion we first replace 

V hy V ■ Ml J and W by Mij • W — this clearly keeps invariant both sides of ([1]), and 
moreover turns V and W into GLi-subschemes, where GLi :— GLj^ x • • • x GLj^, 
which acts on Mk,i x M\^m by 

(51, . . . , gt)i{Ai, At), (Bi, . . . , Bt)) = {{Aig^\ Atg-'), (g.B,, gtBt)) 

Let / S I{V ■ W), which now equals the left-hand side of H]). Define h e A'[Afk.i x 
Mi.m] hy h = fJ-Hf), so that h{A, B) = f{AB). Then h is invariant with respect to 
the action of GLi, and moreover h is in the ideal of ^ x W . This latter fact implies 
that 

heI{V X Mi,^) -f /(Mk,i X W)] 

split h = hi + h2 accordingly. Applying the Reynolds operator p : K[M\^^\ x Mi^m] — > 
ii:[Mk,i X Mi^m]^^' yields h = p(hi) + pih^). By GLi-invariance of V and W, 
p{hi) and p{h2) still are elements of I{V x Afi^m) and /(Mk,i x W), respectively. 
Furthermore, p{hi) and p(/i2) lie in 

By the First Fundamental Theorem for GLi^ applied to ki covectors and rui vectors 
the puUback of multiplication Mk^j. x Mi.^m. — > Mfe..™. is a surjective homomor- 
phism 
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for all i ~ 1, . . . ,t. Hence the puUback of multiplication Mk,i x Mi^m -Mk,m is a 
surjective homomorpliism 

let p{hi),p{h2) lift under this surjection to hi,h2 G -?4r[Afk.m], respectively. Note 
that hi, h2 are not unique if h < min{A:i, rm} for some i, but this is irrelevant here. 
We now have hi e I(V ■ -M^m), h2 € /(Mk,i • W). Moreover, restricted to the image 
Afk.i • -^i,m we have f = hi + h2, since 

f{AB) = h{A, B) = p{hi){A, B) + p{h2){A, B) = hi{AB) + MAB) 

for all A e M]i^i,B E Mi.m- Hence /' := / — {hi + 112) vanishes on Afk.i • -A'/i,m, 
which contains both V ■ M\^m and Mk,i ■ W . But then /' lies in both liV ■ Mi.m) 
and /(Mk,i • W), and hence / G I{V ■ Mi,m) + /(Afk,i • W) as claimed. □ 

We will use Theorcm l4.1l to describe the ideal of V^-M^ explicitly from the ideals of 
V and W . Before we can do this we need one more tool. Suppose a: X x Y 
is a morphism where X and Y are affine varieties over K, and suppose S* is a closed 
subscheme of Y defined by an ideal /. Then there exists a uniquely determined 
subscheme S' of X such that a{S' x X") C S and such that 5" is maximal with 
this property. Scheme-theoretically, S' is equal to S' — HtigK" K^i^^^i^)) where 
for V e K'\ iy is the inclusion X ^ X x {v} C X x K"-. The ideal of 5" is 
determined as follows: Let /' be the ideal in K[X] generated by all functions of the 
form — a'^{f){-,v) = f{a{-,v)) where / £ / and v G is an arbitrary (closed) 
point. In other words, fy{x) = f{a{x, v)) for x € X. 

Since K[X x K"-] — K[X\ ®k K[xi,X2, ■ ■ ■ , Xn], for any / £ K[Y], we may write 
a'^if) uniquely as 

(2) = 

i 

where the nii are some linearly independent monomials in K[xi, X2, ■ ■ ■ , Xn], and 
hi G -ft'i-'^]. It is now elementary to check that /' is generated by all hi G K[X] 
that appear in such an expression ^ a.s f runs through /. Indeed, the ideal 
generated by where v ranges over if" is precisely the ideal generated by all 
hi,h2, ■ ■ ■ ,hs- This is easily seen by picking s points vi,V2, ■ ■ ■ ,Vs in such that 
det[mi{vj)] ^ 0, which is possible as K is infinite and the rrii are supposed to 
be linearly independent. This observation is important because it shows how to 
compute a finite list of generators for /' out of finitely many generators for /: if 
/i, /2, . . . , fm generate /, then the (finite) collection of all hiS appearing in one of 
the a^{fj)s generates /'. 

We will apply this construction to the case where X — M\^^\,K" — Mi^m and 
a — p equal to matrix multiplication. Then for V C Mk^mj the ideal I{V') is 
generated by all functions on Mkj of the form f{xB) where B G Mj m is arbitrary, 
and / G I{V). In fact, thinking of the entries of B as variables, we may expand 
f{xB) as a polynomial in the entries of B; the coefficients are then the required 
elements of the ideal of V . See also Example 16.41 

The following corollary is crucial for explicit computations; it is a slight gener- 
alisation of [H Lemma 12], which gives the same equations for V ■ Mi.m. Although 
there the result is stated only for subvarieties, their proof should also go through 
essentially unchanged. 
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Corollary 4.2. Let k, 1, m G N* and let V be a subscheme o/ A/k,i with V — V-M\^i. 
Define the scheme Ri C Afk,m by the ideal generated by all (h + l)-'minors of the 
i-th component, for all i — \, . . . ,t. Then we have 

(3) I{V ■ Mum) = I{V') + 

where V is defined as in the preceding paragraph as the unique maximal subscheme 
o/Mk,m such that V ■ M^,! C V. 

It is well known that Mk,i • = -Ri as schemes, and in particular that the 

ideal of R\ is radical. 

Proof The inclusion D follows from V ■ Mi^^ C y' n Rv First, V ■ A/i,^ C i?i = 
Mk,i • Mi,m is clear. Second, V ■ Mu = V implies that V ■ Mi.^ • Afm,i C V, i.e. 

V ■ Mi^m c y . 

For the opposite inclusion, set W :— Afm,i • Afi,m, and apply Theorem 14.11 with 
(k, 1, m, V, W) replaced by (k, m, m, V',W). Indeed, V ■ M^.i V hy definition 
of V, so that the left-hand side of ([3]) is contained in KV' ■ Mm,i ■ M\^m), which is 
the left-hand side of H]) with {V,W) replaced by {V',W). With this substitution 
the right-hand side of H]) reads 

HV ■ M^^^) + /(Mk^™ • W) 

which, as A/k.m -W — R\, equals the right-hand side of (l3|). □ 

The corollary is the reason why we had to use subschemes instead of subvarieties: 
in general, V is not a variety even if V is, so the ideal yJI{V') of functions vanishing 
on the closed points of V' may be larger than I{V'), and hard to compute. However, 
the corollary shows that to compute I{V ■ M\^m) only the ideal I{V') is needed, and 
for this ideal generators can be found as described above. 

We will apply Theorem 14 . 1 1 and its corollary in the following setting: Let V be 
a representation of G, O the set of all irreducible characters, and for w S SI denote 
by a fixed irreducible representation of type uj. Then V = @^^q_ where 
V[lo\ is the sum of all submodules of V isomorphic to M^. Moreover, V[ijj\ = 
® Romc{M^,V) ^ (g) K'^i'^'^^ with m{uj, V) the multiplicity of in V. 
In particular, if W is another representation of G, then the space of equivariant 
maps V ^ W is 

}iomGiV,W) ^ 0Hom(i4r'"("'^),ii'™('^''^)). 

The varieties we are interested in are subvarieties of IIom(3(y,VF) stable by 
multiphcation with EndG(^) or EndG(VF) where V, W are some representations of 
G. So let U, V, W be three representations of G, and suppose S C IIomG(?7, V) and 
T C }lomQ{V,W) are subvarieties or subschemes. To apply Theorem 14.11 we may 
identify HomG(l^, W) with Mk,i, RomdU, V) with Mi^^n, and EndoiV) with Mij, 
by putting li = m(wi,y),fci = m{LUi,W), and = m{u!i,U), respectively, where 

n = {UJI,UJ2, . ■ . , Wt}. 

With these identifications in place, the ideal of T ■ Endc^V) ■ S is equal to 
I{T ■ Afi.m) + -f(A/k,i • S). Similarly, if T is stable by right-multiplication with 
EndG(V, V), then the ideal of T-HomG(J7, V) may be computed using Corollarv l4.2l 
We will see several applications of this in the next section. 
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5. Proofs of the main results 

Before proving our main results, we investigate how and CVEM(r) behave 
under base changes. Thus let T be a G-tree and write GL^ for the product 
ripevertexCT) ^L(^)'^- '^^'^ hand, this group acts on TepQ{T) by the ac- 

tion of GL{Vq)'^ X GL(V^)'^ on {Vq (g) Vp)^. We already encountered a special case 
of this action in the proof of Proposition 13.31 On the other hand, given h G GL^ 
one can define a new G-spaced tree hT as follows: the underlying tree of hT is the 
same as that of T and the space Vp at each vertex is also the same as that of T, 
but the bilinear form is determined by 

{hpu\hpv)'p := {u\v)p for u,v d Vp, 

where the latter bilinear form is the one assigned to p in T. Finally, a vertex p is 
based in hT if and only if it is based in T, and then the basis associated to it in hT 
is Bp := hBp, where Bp is the distinguished basis of in T. A representation A of 
T gives a representation of hT, also denoted A, by simply taking the same tensors 
Aqp £VqiS)Vp along the edges. 

Lemma 5.1. In the setting above we have "S/fiTi^) = h'i'T{h~^A). 

Proof. If T has only two vertices p ^ q, then this just the obvious equality Aqp = 
[hq, hp){h~^ , hp^)Aqp. If T has more than two vertices, we pick any internal vertex 
g of r and split T — *iTi and A =^ *iAi at q. Assuming the result for all Tj we find 

= J2 (»^{b\h^^>TAh-'A,)y 

= J2 (g>^ihqb\h^Tdh-'A,)y 

beBg 

= /i ^ (»,{b\-^T,{h''A)) 

bGBg 

= h-^hTih-^A). 

□ 

In particular, this lemma implies that CVEM(r) — CVEM{hT). For G — {1} 
we note that if T' is any spaced tree with the same underlying tree as T and 
the same G-modules Vp at the vertices, but different bilinear forms and different 
(orthonormal) bases, then there exists an ft, G GLy with hT = T'. In this sense the 
variety CVgm(T') does not depend on the chosen bases and forms, as long as they 
are compatible. 

Remark 5.2. A stronger basis-independency holds at vertices of valency two. 
There the operation * boils down to matrix multiplication, or composition of linear 
maps, and this will enable us to apply Theorem 14. II Indeed, let U, V, W be vector 
spaces equipped with non-degenerate symmetric bilinear forms and let ^'i G VF®^^ 
and 5*2 G V ®U he arbitrary. Let B be any orthonormal basis of V. We claim that 
the element 

^(&|*i)®(6|*2) eW(g,U 

b£B 
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does not depend on B, and under the identification U ^ U* coincides with the 
hnear map U ^ W which is the composition oi '^i and ^2, considered as hnear 
maps under the identifications Wi^V = Wi^V*= Hom(y, W) and V 1^ U = 
V (E)U* = Hom(C/, V). It suffices to verify this for rank-one tensors = w^v and 
"^2 ^ v' (iSi u. The expression above is then 

[^{b\v){b\v')]w «) u. 

beB 

By the orthonormahty of B this reduces to {v\v')w ® u, as ciaimed. 

Thus, in hindsight, we could have left out the orthonormal bases at vertices 
of valency 2 in the definition of (G'-)spaced trees, and defined the operation * as 
composition of linear maps. We have not done so to keep the treatment of internal 
vertices uniform. 

Next we observe that the map ^'t defined in Section [1] behaves well with respect 
to the group action. Let T be a G-spaced tree. Note that G acts naturally on 
rep(r) by its action on each tensor product Vp i^Vq with p ^ q. 

Lemma 5.3. The map ^>t '■ rep(r) — > L(T) is G-equivariant. 

Proof. If T has exactly two vertices the assertion is immediate. Otherwise, let q be 
an inner vertex of T and split T around q into Ti , . . . , . The Ti are G-trees in a 
natural way. By induction, we may assume that ^'j'^ is an equivariant map. Then 

^-tM) = ^ (H «'TiMi)) ® • • • (6 I ^-T.Mfe)) 

beBg 

= ^ (6 I s^-TiCAi)) ® • • • ® (5 I g-^rMk)) 

beBg 

= 9J2 (5"'^ I *Ti(^i)) ® • ■ • (g'^b I g'^rMk)) 

b£Bg 

= g^T{A), 

where the second equality follows from the G-invariance of (. | .)q and the last 
equality follows from the fact that g~^ permutes Bq. □ 

This lemma implies that CVem(T) Q L{T)^. In what follows we focus on the 
ideal of CVem(T) inside K[LiT)^]. To obtain the ideal inside K[L{T)], one just 
adds linear equations cutting out L{T)'^ from L{T). 

Remark 5.4. For k = 2 the computation in the proof of the lemma can be replaced 
by the following argument, using the notation of Remark 15.21 If U, V, W are G- 
modules and V carries a G-invariant symmetric bilinear form, then the unique 
bihnear map W(^VxV(^U^W(^U sending {w v,v' ^ u) to {v\v')w ® u 
is G-equivariant. So at vertices of valency 2 it is not crucial that G permutes the 
basis. 

With these preparations, we are now ready to prove our first main result. 

Proof of Theorem \l.(A Let T be a G-spaced tree. We recursively express the ideal of 
CVEM(r) into the ideals of CVem('S') for substars S oiT with at least three leaves, 
as follows. First, if T has only two vertices p ^ q, then CVem(T) = {Vp (g) Vq)'-^ = 
L{T)° and we are done. Second, if T is itself a star with at least three leaves, then 
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we are also done. Third, suppose that T contains a vertex q of valency 2, and split 
T accordingly as T = Ti * so that 

cvEM(r) = {*i * *2 1 -J-, e cvEM(r,) for i = i, 2}. 

Let Li be the space 0pgioaf(Ti)\{q} and L2 = 0peioaf(T2)\q ^p- Of course 
Li,L2 are naturally G-representations. Now the map (5'i,^'2) 1-^ ^'i * 5*2 from 
L(Ti)'^ X L{T2)^ to L(r)'^ is just matrix multiplication if we identify L{Ti)^ with 
HomG(lp, -Z^i) and L{T2)'^ with HomG(L2, V"p); see Remark [Ol 

We want to apply Theorem 14.11 Recall the definition of uii and m{LUi,V) from 
Section m Now define k, 1, m by := m(uji,Li), li :— m(LUi,Vq) and rrii := 
miuj,,L2). Then HomG(V;,,Li) = Mk,i, HomG(L2,yq) = Af,,^ and Endcly,) = 
Mi,i. Notice that V := CVEM(ri) (resp. W := CVEM(r2)) are stable under right- 
(resp. left-) muhiphcation with Mij, and CVemC^) = V-W = V- Mi,i • W". Thus 
Theorem 14.11 applies and we deduce that 

/(cvem(t)) = /(cvEM(ri) * cvEM(b,r2)) + /(cvEM(b,ri) * cVem(T2), 

where CVEui^qTi) = HomG(ii, Vg) = HomG(V"g,Lj) ^ (L^ (g) Vq)'~^ because G acts 
preserving the form. Recursively, we may assume that the ideals of CVem(T'i) 
and CVem(T'2) have been computed. Finally, the two terms on the right-hand 
side can be expressed into /(CVem(T'i)) and /(CVem(J2)) using Corollarv 14.21 
Following the recipe at the end of Section [H we may compute e.g. /(CVem(T'i) * 
CVEM(t'qr2)) = /(CVem(Ti) • HomG(L2,K,)). This concludes the case where T 
contains a vertex of valency 2. 

Finally, if T is neither a star nor contains a vertex of valency two, then it contains 
an edge p ~ r where both p and r are internal vertices of valency at least three. 
Let T' be the G-tree obtained from T by inserting two vertices qi and 52 between 
p and r so that p ^ qi q2 ^ setting Vq-^ := Vr with the same bilinear form 
and basis, and Vq^ :— Vp with the same bilinear form and basis. Note that every 
G-spaced substar of T' with at least three leaves is also a G-spaced substar of 
T. This is why we inserted two vertices rather than one: what space should we 
attach to a single vertex between p and g? See below for a comment on this. By 
the previous construction, we can express the ideal of CVem(T") in the ideals of 
CVem(5') of all substars 5' of T', hence of T, with at least three leaves. So we are 
done if can show that CVem{T) = GYem{T'). But any A' G repG(T') gives rise 
to an A e repG(T) by setting Apr :— ^pqi^qiq2^q2r- Using Remark [5?2l one finds 
that ^'t(^) — '^T'{A'). Conversely, for any A € repQ{T) we can factorise Apr into 
A'pq_^A'q_^q^A'q^r witli G-iiivariaut factors. This gives a representation A' of T' with 
'i>T'{A') = 'i>T{A). This concludes the proof of the theorem. □ 

Remark 5.5. Note that for G = 1, i.e., for the general Markov model, the proof 
above can be simplified slightly: one does not need the decomposition into isotypic 
components, and may apply Theorem 14 . 1 1 wit h t — 1 directly. 

The proof above yields to the following algorithm for computing /(CVemI?")) 
from the ideals of substars. 

Algorithm 5.6. 

Input: a G-spaced tree T and finite generating sets of the ideals I{CVbm{S)) Q 

K[L(S)^] for all substars S" in T with at least three leaves. 
Output: a finite generating set of the ideal /(CVem(T)) C K[L{T)^]. 
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Procedure: 

(1) If T contains only two vertices p q, then return the empty set and 
quit. 

(2) If T is a star with at least three leaves, then a finite generating set of 
/(CVem(T')) is part of the input; return this set and quit. 

(3) If T has a vertex of valency 2, then choose such a vertex q and split 
T = Ti*T2 at q. Apply this algorithm to Ti and to T2 (with the ideals 
of their substars) to find finite generating sets Fi,F2 of the ideals of 
CVemI^i) and CVem(T'2), respectively. Let Li^L^ be as in the proof 
of theorem 1 1.61 identify L{T)'^ = HomG(Li,L2) with Mk,m as in that 
proof, and write 1 for the tuple of multiplicities •m{uji^Vq). Identify 
L{Ti) with Mk,i and L{T2) with Mi^m, so that Fi C fs:[Mk,i] and 
F2 C /^[Mi^m]- Write ^E" for an element in Afk.m whose coordinates 
are variables. 

(a) Let F' denote the collection of all (^^ + 1) x {li + l)-minors of 
the i-th component of 5* for alH = 1, . . . , i. 

(b) For an element 'I'o G Afm.i with new variables as coordinates, 
expand /(^f • '^0) for each / e -Fi, and take all coefficients of 
monomials in ^fo, which are polynomials in 4". Collect these 
polynomials in F[. 

(c) For an element ^"0 G -Mk,i with new variables as coordinates, 
expand /(^'o • ^) for each f G F2, and take all coefficients of 
monomials in ^'o, which are polynomials in Collect these 
polynomials in 

Return F' U F{ U F^ and quit. 

(4) Take an edge p ~ r in T connecting two vertices of valency at least 
three. Construct a G-tree T' by inserting two new vertices qi, (72 such 
that p qi q2 r and setting Vq-^ := Vr with the same basis and 
bilinear form and Vq^ := Vp with the same basis and bilinear form. 
Run this algorithm on T', return the same output as for T' , and quit. 

Although the tree grows in the last step, it is easy to see that this algorithm 
terminates: after inserting vertices, in the call with T' the tree is broken into two 
trees, each of which have strictly less substars with at least three vertices. This 
algorithm is partly carried out in Example 16.41 

Remark 5.7. In the last step of both the proof and the algorithm we could also 
have inserted a single vertex q between p and r, with Vq equal to the G-module 
having multiplicities m{uji, Vq) = min{m(a;i, Vp), m{uji, Vr)} for all i, so that all G- 
equivariant maps Vp ^ Vp factorise through Vq. One can show that this Vq carries 
a G-invariant, non-degenerate symmetric bilinear form since Vp and Vr do. This 
set-up would have raised two minor problems. First, the object T' thus constructed 
is strictly speaking not a G-tree, as Vq may not have an orthogonal basis permuted 
by G. But as we saw in Remarks l5.2l and l5.4l this is not really a problem: we can still 
apply Theorem 14.11 at q to split T' into smaller trees. Second, the G-spaced stars 
S'p, S'r with centres p and r in T' are not equal to the G-spaced stars Sp, Sr around 
p and r in T. Hence after expressing /(CVem(T')) in the ideals /(CVem(<S')) for 
all stars S with at least three leaves in T', we still need to express the ideals of 
CVem('S'p) and CVemC-S'^) in CVem{Sp) and CVEM(5'r.), respectively, to prove the 
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theorem. The following lemma does just that. This would give a slight variant of 
the algorithm above. 

Lemma 5.8. Let S' , S be G-spaced stars with the same underlying star having q as 
centre andpi, . . . ,pk as leaves. Suppose that both stars have the same space Vq with 
the same basis Bq and that we are given G-equivariant injections Ti : V^'. — > Vp- for 
i = 1, . . . ,k, where Vp. and Vp- are attached to pi in S' and S, respectively. Denote 
by T the induced injection L{S') — > L(S). Then 

/(CVem(^')) = r«/(CVEM(5)). 
In particular, a finite generating set for /(CVem('5')) gives a finite generating set 
for /(CVemI-S")) under pull-back by t. 

Proof. For the inclusion D note that any G-representation A ~ (Ap. g); of S gives 
rise to a representation A' = {TiAp._q)i of S' satisfying t'^ s{A) — 5's'(^')- Hence 
T maps CVem(S') into CVemCS"). 

For the inclusion C note that, as Vp. is a completely reducible G-module, there 
exist G-equivariant surjections tt^ : V^'. — > V^. with -KiTi = idvp.. Now the in- 
duced projection tt : L{S') L[S) maps CVem(5'') into CVemCS"), and if / G 
/(CVem(5)), then / = r«(^«/), where lies in /(CVem(^'))- □ 

Now we prove our second main result. 

Proof of Theorem \1.T\ Recall the statement of the theorem: for any G-tree T we 
have 

(*) /(CVem(T))= HCVemM). 

ij6vortex(T) 

We proceed by induction. First, the statement is a tautology for a star T. Next, 
suppose that T has an inner vertex q of valency 2 and split T — Ti *T2 si q. By 
induction we may assume that the theorem holds for Ti. The proof of Theorem ll.6l 
shows that 

/(CVem(T)) = /(CVem(Ti) * CVEM(bgT2)) + /(CVEM(b,Ti) * CVem(T2)), 

so it suffices to prove that each of these terms is contained in the right-hand side of 
0; we do so for the first term. We use the notation Li, ^2, k, 1, m from the proof 
of TheoremHH By Corollary the ideal of CVem(Ti) * CVEM(bgT2) is spanned 
by polynomials in 'J G L{T) = HomG(i2, Li) of the following two forms: 

(1) for alH = 1, . . . , t the [li + l)-minors of the i-th component of regarded 
as an element of Afk,m; and 

(2) all polynomials of the form * i-^ /(^^fo), where / G I{CY-em{Ti)) and *o 
is some element of HomG(V^, L2). 

The first type of elements are clearly equations for CYgm^!;^ qT), so we need only 
worry about the second type of equations. By induction we may assume that / 
is an equation for CYcM^rTi) for some vertex r of Ti. But then '5 1-^ /(^'^'o) 
vanishes on CVGM(t'r-T'), and we are done. 

Finally, if T is not a star and does not contain a vertex of valency 2, then we 
proceed as in the proof of Theorem II. 61 We choose an edge p r in T where both 
p and r have valency at least three, and insert vertices qi , (72 with p ^ qi ^ q2 ^ r 
and Vgj := K- and Vq.^ := Vp to obtain a new G-spaced tree T' . We claim that 
both sides of Q remain unchanged in replacing T by T'. For the left-hand side 
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this was proved in the proof of Theorem II. 61 The right-hand side gains two terms, 
namely, /(CVEM(bqiT')) and /(CVEM(t'g2?"))- However, the definition of flatten- 
ing readily implies that CVEMlt-pT) C CVEMlt-g^T'), so that /(CVEM(bq2T')) C 
/(CVEMlt-pT)). Similarly, we find /(CVEMlt-qiT')) C /(CVEM(t'rT)), and hence 
the two extra terms on the right-hand side of (Q are redundant. Now the theorem 
for T follows from that for T' , which in turn follows by induction as in the previous 
case. □ 



Finally, we prove the last statement of Proposition [3?4l which says that GW'EM{T')Vr' 
can be computed from the ideal of CVEM(r'). 

Proof of Proposition \3.4\ We will apply Corollary 14.21 where CVem(I") will play 
the role of V and Vr' will play the role of Mi^m- To this end we proceed as in the 
proof of Theorem 11.61 First set L := (S'pgicaf(T')\{r'} ^ ^i' ■ • ■ be the 

characters of G. Next define k, 1, m as follows: ki := ■m{uji, L), li :— m{uji, Vr'), and 
m,j :— ■m{u!i, KG) with KG the left regular representation of G. View CVemC?^') 
as a subvariety of HoniQlVr' , L), which can be thought of as M^j, and view Vr' 
as the space Hoirq (KG,Vr') under the isomorphism i— > i— » gv), which can be 
thought of as Mi^m- It is easy to see that CVem(T'') is closed under composition 
with Home (14', Vr'), which is M\^\, so that Corollarv l4. 21 applies. We conclude that 
I {CVEM{T')Vr') , regarded as a subset of Mk,m is generated by the rank-(/i + 1)- 
minors of the i-th block for i = 1, . . . ,t and the polynomials L K oi the form 
vj/ /(^'^'o), where is regarded as a G-homomorphism KG L, f runs over 
^(CVemC^T')) and over aU elements of UomciVr, KG), which is Mx m- 

Of course, like in Algorithm 15.61 this can be made into a finite set of generators 
by taking the entries of 'i'o to be variables, taking / in a finite generating set of 
I{CVem{T')), expanding, and taking the coefficients of the monomials in ^"0. □ 

We have now reduced the ideals of our equivariant models to those for stars, and 
argued their relevance for statistical applications. The main missing ingredients 
for successful applications are equations for star models. These are very hard to 
come by: [5] posed several conjectures concerning these for the general Markov 
model, and special cases of these conjectures were proved in [H [131 [14]. For certain 
important equivariant models equations were found in [11 [16]. Roughly speaking, 
the less symmetry one imposes on the model, the harder it is to find equations. 
The following proposition offers some explanation for this. 

Proposition 5.9. Suppose that T is a star with (based) centre r. Let bi, . . . ,bs be 
representatives of the G-orbits on Br and denote by Gi the stabiliser of bi in G. 
Let Ci denote the cone of pure tensors in 0pgieaf(T)(^'^') — ^i"^)' ^''^^ denote by 
p the Reynolds operator for G. Then 



CYem{T)^ piCi) + ... + p{Cs), 
where the addition corresponds to taking the join of these varieties. 
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Proof. Elements in an open dense subset of CVem (T) look like 

s 

(g) Aprb = Y, E "^p^-dh 

be-Br pGlcaf (T) 8=1 geG/Gi pelcaf(T) 

i=l geG/Gi \j9elcaf(T) 




A h 



pelcaf(T) 



pelcaf(T) 



where Vip — Aprbi G ; the latter element clearly lies in the join p{Ci) + . . . + 
p{Cs). This argument can be reversed to show the opposite inclusion. □ 

This is particularly interesting in the case when for all internal vertices q, Bq is 
a single G-orbit. As usual, we may assume that T is a star, and then CVEM(r) is 
simply p{C) where C is the set of pure tensors in {8'peicaf(T) (^^) '^i^h H = Gt the 
stabiliser of some element b G Bq. p being a linear projection now means that the 
ideal may be computed by elimination theory, at least in principle. This applies to 
(1), (2), and (6) in Example O 

In the following section we record some further observations for abelian groups 

G. 



6. Abelian groups and toricness 

In this section we collect some results on the equivariant model for an abelian 
group G. The fact that all irreducible representations of G are one-dimensional 
makes G-equivariant models somewhat easier to analyse than general equivariant 
models. Recall that an element w in a G-representation is called a weight vector if it 
is a common eigenvector of all elements of G; in that case the function A : G ^ K* 
determined by gv = X{g)v is a character of G. We also say that G scales v by A. 
The following results are a slight generalisation of results in [51 [TB] . 

Proposition 6.1. Suppose that T is a star with (based) centre r, that G is abelian 
and that Br is a single G-orbit. Then CVemC?") is a toric variety. More specifically, 
there exist tori Sp in GL(l^) for p G leaf(r), diagonalised by certain bases of G- 
weight vectors such that ripGicaf(T) stabilises CVem (2^) with a dense orbit. 

Proof. Fix b £ Br. A typical element of CVemI^) looks like 

E 5 

geG/Gb \peicaf(T) / 

with Vp G V^*" =: Vp. As G is abehan, Vp is a G-module. Choose any basis of 
Vp that diagonalises G and let Sp be the associated torus in GL(Vl'), regarded 
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as a torus in GL(V^) acting trivially on a G-stable complement of in Vp. Set 
S :~ npeieaf(T) ^p- Then (sp)p € S sends the element above to 

E 5 SpVp , 

geG/Gt \ P I 

which again lies in CVemC?")- Moreover, as each Sp has a dense orbit on V^', S has 
a dense orbit on CVemC?"). □ 

Remark 6.2. If G is abelian and has fc orbits on V^, then CVEM(r) for a star T 
is a join of fc toric varieties. This fact is exploited in [3] for the strand-symmetric 
model, where fc = 2. 

Theorem 6.3. Suppose that G is an abelian group and that T is a G-tree in which 
G has a single orbit on all Bp with p S internal(T). Then CVem(2^) is a toric 
variety. More precisely, there exists a torus S acting linearly on L(T), stabilising 
CVemCT") with a dense orbit, and diagonalised by a tensor product of G-weight 
bases of the Vp,p £ leaf(T). 

There is a subtlety here: unlike in Proposition 16. II such a torus can in general 
not be found in IlpeioafCT) ^GL(y,)G. 

Proof. We proceed by induction. First, if T is the single edge pq, then CVem(T) = 
{Vp ® Vq)'^ = 0;^_|_^^o Vp[\] ® Vq[^A, wlicrc (A, /i) ranges over pairs of characters of 
G. Let 5 be a maximal torus in GL((Vp ® Vq)'^) which is diagonal with respect to a 
basis of pure tensors v\ ® V-\ with v\ €Vp and V-\ G Vq weight vectors of weights 
A, — A. View S" as a torus in GL{Vp ® Vq) acting trivially on all V^[A] ® Vq[^j\ with 
A + /i ^ 0. This S has the properties claimed in the theorem. 

Second, if T is a star, then the proposition above does the trick. Third, if T is 
neither a star nor an edge, then let q G internal(r) be any internal vertex of valency 
two. As in the proof ofTheorem ll.61 we may add such a vertex, if necessary, without 
changing CVem(T) — and in fact, if q is inserted between the internal vertices p, r, 
then, as Bp, Br are G-orbits, Vq := KG is sufficiently large. Write T = Ti *T2 at 
q, let Li ®p(=icaf(T )\g ^^'^ 1'^^ Si,S2 be the tori whose existence is claimed 
by the theorem for the G-trees T;. In particular, Si is diagonalised by the tensor 
product of a G-weight basis Gi of Li and a G-weight basis Di of Vq (such a basis is 
always orthogonal and may be chosen orthonormal). But since Vq is a permutation 
module with a single orbit, every weight occurs at most once in Vq, so that (after 
scaling) Di = D2 =: D. Now we let Si act on Li as follows: let Ci € Ci have 
G-weight A. If —A is not a weight in Vq, then SiCi :— Ci. If, on the other hand, 
—A is a weight in Vq, then it is the weight of a unique d € D, and we let Si scale 
Ci by the character with which it scales d^ Ci G L{Ti). Now S := Si x S2 acts on 
Li IS) L2 = L{T), and is diagonalised by the tensor product of Gi and G2. 

Finally we verify that S stabilises CVEM(r) with a dense orbit. To see this, 
observe that the map {Vq ® Li)'^ x (V, L2)*^ ^ {Li » ^2)"^ given by (^'i, ^'2) ^ 
X^beB, (^1 I ^) ® I b) is Si X S'2-equivariant: it sends {di ®ci,d2® C2), where 
di and Ci have opposite weight, to {di \ d2){ci (81C2), which scales with the same 5- 
characters by definition of the action of S. Since Si has a dense orbit on CVEM(7i), 
S has a dense orbit on CY-e,m{T). □ 
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Theorem 16.31 reduces the computation of the ideals of certain equivariant models 
to the combinatorics of toric varieties (where we do not require toric varieties to be 
normal). However, this combinatorics can be very intricate, and it requires great 
ingenuity to find explicit generators as in [16] . We conclude with an example. 

Example 6.4. First let T be a star with centre q and four leaves pi, . . . ,p4. Let 
G = Z2 = and Vp — KG for all vertices p ^ T, with basis G and induced 

form. For this G and Vp, a star with four leaves is the smallest G-star for which 
CVem(T) C L{Tf. 

We are free to choose any basis on L{T), so we pick the orthonornial product 
basis of the basis of KG diagonalising G. Let us denote this basis by {t, s) where 
t = 1 + X spans the trivial, and s — 1 — x the sign representation in KG. We will 
label the basis of L{T) given by the pure tensors 61 ® 62 ^3 (Xi 64 {bi E {t, s}) as 
follows: for a subset / C {1, 2, 3, 4} let 6/ = 61 (g) 62 63 (8" 64 where bi = s ii i E I , 
and hi ~ t otherwise. Then L{T)^ is spanned by all bj such that |/| is even. 

As HomciKG, KG) = K^, an equivariant representation of T is specified by 8 
parameters yi,Xi (i = 1,2,3,4) where Xi is dual to s and yi is dual to t in KG, 
and Ap.q = (xiS + yit) ® 1 + (yit — Xis) x (where Pi,P2,P?,iPi are the leaves). By 
Proposition 15.91 CVEM(r) = p{P) where P is the variety of pure tensors in L{T). 
Specifically, if ^ = {xi,yi)f^i is a representation, then 

^'t(A) = 2p((g)(.T,s + y,t)). 

i 

Let the variables dual to the basis (6/)/c{i.2,3.4} be denoted by xj. Then xj{'i'TiA)) — 
if |/| is odd, and 

xji^T{A))^(l[xi) [l[y-], 
\iei / / 

if |/| is even. Thinking of CVem(7') as a subvariety of L{T)^, for determining the 
ideal we need to consider only functions in the xj with |/| even. There are some 
obvious relations, namely, if /, J are two subsets of {1, 2, 3, 4} with even number of 
elements and with complements /'^, J'^, respectively, then 

//,,/ := xixj. - xjxjo 

is in the ideal of CVem(T). Note that it is enough to consider where every 
subset appears once as / or J, because fi^j + fj^j' = and if |/| = \J\ = 2, 

then fjj is nonzero only if / n J contains one element. Altogether it follows that 
the ideal generated by the //,j's is already generated by 

/0,{1,2}7 /0,{1,3}: /0,{1,4}- 

One can show that I{C'Vem{T)) is generated by the //,j's. Indeed, by [B] Theorem 
3.1] this boils down to showing that these //, j's correspond to a Markov basis for the 
module of Z-linear relations among the 8 vectors (a, b) £ {0, l}'^ x {0, 1}^ C N'^ x N'^ 
where ]a] := J^i'^i even and b = 1 — a with 1 = (1,1,1,1). We omit the 
combinatorial details here. 

To illustrate Theorem II. 6[ we now consider a G-tree T obtained by gluing to- 
gether two stars as above at one common leaf (see Figure [1]). Notice that by 
Remark O and Lemma EH CVem (r) = CVem(T') where T' is the tree T with 
vertex 4 removed and the centres of the two stars in T joined by an edge. 
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Figure 1. The spaced tree T. 



Algorithm 1 5 . 61 first identifies a vertex of valency 2; here vertex 4. We then write 
T — Ti * T2, with Ti the left and T2 the right star with four leaves each. By 
the above we know the ideals of CVEM(Ti). The content of Theorem 11.61 in this 
situation is that 



Because of the symmetry of the problem, we only consider the first summand. We 
keep the notation introduced above with respect to 7^: the variables on L{Ti)'^ will 
be a;/ (/ C {1, 2, 3, 4}, |/| even) and those on L{T2)'^ wiU be yi {I C {4, 5, 6, 7}, |/| 
even). Finally the variables on L{T)'^ will be zj where J C {1, 2, 3, 5, 6, 7} has an 
even number of elements; J corresponds to the basis vector 6j = 61 (g) 62 &3 ^ ^5 18) 
65 ® h'j where 6^ = s if i e J and bi — t otherwise. We also adopt the convention 
that xi,yi, 2;/ = if |/| is odd. 

The ideal of CVem(Ti) * L{T2)^ is generated by /' := /(CVem(Ti)') and certain 
2 X 2-minors, since both representations of Z2 occur with multiplicity one in V4. 
These minors are of the form 



where /i,/( C {1, 2, 3}, /2, /2 ^ {5,6,7} are all distinct and either all even or all 
odd. Next we show how to find generators of /'. The space L{T2)'^ is isomorphic 
to HomG(V4,0pgieaf(T2)\{4}) " ^^m,i with m = (4,4) and 1 = (1, 1). Similarly, 



/(cvEM(r)) = /(cvEM(ri) * L{T2f) + i{L{Tif * cvEM(r2)). 




.peloaf(T2)\{4} pelcaf(Ti)\{4} 





m 



with k = (4, 4) and m as above. 

Let *o e L{T2)^ be arbitrary and let * e L(T)^ . Then ^"^0 e HomG(^4,{8) 
L(Ti)'~^ and a straightforward computation shows that 



'pelcaf(Ti)\{4} ^p) — 
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To avoid clumsy notation, let us write xi for the function a:/(^'5'o) in the arguments 
(^'jVjfo). Then for example 

(4) ^0 =202/0 + Z{5^6}y{5,6} + 2{5,7}y{5,7} + 2{6,7}y{6,7} 

(5) ^{1,2,3,4} =2{i 2,3,5}y{4,5} + 2{i,2,3,6}2/{4,6} + ^{i, 2, 3, 7} 2/(4,7} 

+ 2{l,2,3,5,6,7}2/{4,5,6,7} 

(6) ^{1.2} =2{l,2}2/0 + 2{i,2,5.6}2/{5,6} + 2{i,2,5,7}2/{5,7} + 2{i,2,6,7}2/{6,7} 

and finally 

(7) ^{3,4} =2{3,5}2/{4,5} + 2{3^6}2/{4,6} + 2{3,7}2/{4,7} + 2{3^5,6,7}2/{4,5,6,7} ■ 

/' is then generated by all f^^ where / is in the ideal of CVem(7i). As observed 
before, this is the same as the ideal generated by all coefficients of monomials in 
the 2//'s. As an example let us consider /0,{i,2,} = a;0X{i^2.3,4} — 2;{i_2}2;{3,4}. Using 
the relations (l4])-(l7]), we get an expression in the z/'s and 2//'s. As a function on 
-Wk,m X -Mm,i it is cqual to 

(8) /0,{l,2}(**o) = i0i{l,2,3,4} - £{1,2}£{3,4} 

= (202(1,2,3,5} - 2{i,2}2{3,5})2/02/{4,5} 

+ (202(1^2,3,6} - 2{i,2}2{3^6})2/02/{4,6} 

+ (202(1^2,3,7} - 2{l,2}2{3,7})2/02/{4,7} 
+ (202(1,2,3,5,6,7} - 2{i,2}2{3,5,6,7})2/02/{4,5,6,7} 
+ (2{5,6}2{l,2,3,5} - 2{i,2,5,6}2{3,5})2/{4,5}2/{5,6} 
+ (2{5,7}2{l,2,3,5} - 2{l,2,5,7}2{3,5})2/{4,5}2/{5,7} 
+ (2{6,7}2{l,2,3,5} - 2{i,2,6,7}2{3,5})2/{4,5}2/{6,7} 
+ (2{5,6}2{l,2,3,6} - 2{l,2,5,6}2{3,6})2/{4,6}2/{5,6} 
+ (2{5,7}2{1,2,3,6} - 2{i,2,5,7}2{3,6})2/{4,6}2/{5,7} 
+ (2{6,7}2{1,2,3,6} - 2{i,2,6,7}2;{3,6})2/{4,6}2/{6,7} 
+ (2{5,6}2:{1,2,3,7} - 2;{i 2,5,6}2{3,7})2/{4,7}2/{5,6} 
+ (2{5,7}2:{l,2,3,7} - 2:{;i^2,5,7}2;{3,7})2/{4,7}2/{5,7} 
+ (2{6,7}2:{1,2,3,7} - 2:{l,2,6,7}2{3,7})2/{4,7}2/{6,7} 
+ (2:{5,6}2{l,2,3,5,6,7} - 2{l,2,5,6}2{3,5,6,7})2/{5,6}2/{4,5,6,7} 
+ (2{5,7}2{l,2,3,5,6,7} " 2{i,2,5,7}2{3,5,6,7})2/{5,7}2/{4,5,6,7} 

+ (2{6,7}2:{1,2,3,5,6,7} " 2{i,2,6,7}2{3,5,6,7} )2/{6,7}2/{4,5,6,7} • 

Every single coefficient of a monomial in the 2//'s then gives a generator for /'. It 
should be clear how to proceed in principle with the other //,,7's. So Algorithm [SH] 
calls itself twice, once for Ti and once for T2. 
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